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On the convergence of a Risk Sensitive like Filter 

Mattia Zorzi, Bernard C. Levy 


Abstract —In this paper, we analyze the convergence of a risk 
sensitive like filter where the risk sensitivity parameter is time 
varying. Such filter has a Kalman like structure and its gain 
matrix is updated according to a distorted version of the Riccatl 
iteration. We show that the iteration converges to a fixed point 
hy using the contraction analysis. 

I. Introduction 

Physical systems are often modeled by (nominal) lin¬ 
ear models. One reason is that the corresponding filtering 
problem is tractable, for instance if we consider the Gauss- 
Markov state space model then we obtain the Kalman filter. 
On the other hand, linear models are rather simple and thus 
inttoduce modeling errors. This implies that the optimal filter 
may not perform well in the realty. 

One possible strategy to deal with such problem is to use 
robust filtering. The pioneering works are due by Kassam, 
Poor and their collaborators, [10],[14]. This paradigm can 
be sketched as follows. One player (say, nature) selects the 
least favorable model in an allowable neighborhood about the 
nominal model, while the other player designs the optimal 
filter based on that least favorable model. Therefore, the 
optimal filter is obtained by solving a minimax problem. 
However, the implementation of such a filter can be very 
difficult because it depends on the characterization of the 
allowable neighborhood. To overcome this difficulty, a new 
class of robust filters based on the minimization of risk 
sensitive functions, which penalize large estimation errors, 
was inttoduced in [15],[17],[2]. The sensitivity to large 
errors is tuned by a risk sensitivity parameter. This approach 
considers Gauss-Markov state space models and the resulting 
robust filter is a Kalman like filter where the gain matrix 
is updated according to a distorted version of the Riccati 
iteration (say, risk sensitive Riccati iteration). Unfortunately, 
this method only considers the nominal model. In [12], 
a new minimax robust state space filtering problem was 
examineted. In this approach, at each time step all possible 
increments of the state space model are described by a ball 
about the nominal increment. Its radius is fixed a priori 
and represents the tolerance budget available at each time 
step. Therefore, the nature selects the least favorable model 
increment in the allowable ball, and the other player designs 
the optimal filter based on that least favorable model. It turns 

This work has been partially supported by the FIRB project “Learning 
meets time” (RBFR12M3AC) funded by MIUR. 

M. Zorzi is is with the Dipartimento di Ingegneria dellTnformazione, 
Universita di Padova, via Gradenigo 6/B, 35131 Padova, Italy, 

(zorzimat@dei,unipd.it) 

B. Levy is with the Department of Electrical and Computer Engi¬ 
neering, 1 Shields Avenue, University of California, Davis, CA 95616 
(bclevy@ucdavis.edu) 


out the resulting robust filter is a risk sensitive like filter 
where the risk sensitivity parameter is now time varying. 
Accordingly, the gain matrix updating is governed by a risk 
sensitive like Riccati iteration. 

An important issue for Kalman like filters is their conver¬ 
gence. In [3], under the assumption that the Gauss-Markov 
state space model is reachable and observable, it has been 
established that the Riccati mapping is a contraction for 
the Riemann metric associated to the cone of positive defi¬ 
nite matrices, and thus the Riccati iteration asymptotically 
converges. The same result can be proved by using the 
Thompson part metric [11], [5]. In [13], a similar contraction 
analysis has been considered to prove the convergence of the 
risk sensitive Riccati iteration. Here, the problem has been 
formulated in Krein space, see [6], [7]. Then, it has been 
shown that the V-block risk sensitive Riccati mapping is 
strictly contractive for the Riemann metric by choosing the 
risk sensitivity parameter sufficiently small. Regarding the 
risk sensitive like Riccati iteration, it seems to converge [12], 
but no convergence result has been proved yet. 

In this paper, we consider a similar contraction analysis 
to prove that the risk sensitive like filter in [12] asymp¬ 
totically converges for tolerance values sufficiently small. 
More precisely, we formulate the filtering problem in Krein 
space and then we show that the N-block risk sensitive like 
Riccati mapping is strictly contractive provided that the time 
varying risk sensitivity parameter is smaller than a constant 
parameter. Moreover, it is always possible to find a lower 
bound of this iteration after a finite number of steps. As we 
will see, both the constant parameter and the lower bound 
allow to characterize a range of values of the tolerance for 
which the iteration converges. 

The paper is organized as follows. In Section]^ we review 
the risk sensitive like filter presented in [12]. In Section 


III we review the Thompson part metric and contractive 


mappings needed for the following contraction analysis. In 
Section EYI we construct the A^-block risk sensitive like 
Riccati mapping. In Section [V] we characterize a range of 
values of c for which the mapping is a strict contraction. 
Finally, in Section VI we provide an example. Throughout 
the paper, V denotes the cone of positive definite symmetric 
matrices, and V its closure. Given P gV, Xi{P) > ^ 2 iP) > 

... Xn{P) > 0 are its eigenvalues sorted in decreasing order. 

II. Risk sensitive like Filtering 


Consider a discrete-time stochastic process yt described 
by a nominal Gauss-Markov state space model of the form 

xt+i = Axt + But ( 1 ) 

yt = Cxt+Dvt, t>0 (2) 




where the state Xt S K", the process noise ut G M™, and 
the observation noise Vt G The noises ut and vt are 
assumed to be zero-mean WGN processes with normalized 
covariance matrices and independent, that is 


Ut 

r 1 



0 


[Us Vs \ 


0 



where St denotes the Kronecker delta function. The initial 
state vector xq is assumed independent of the noises ut and 
Vt with nominal probability density 

Po{xo) '^Af{0,Po). 


with initial condition Pq. The mapping r^( ) is defined by 

r^{Pt) = A[Pt-^ + C^{DD'^)-^C - 9t-iI]-^A^ + BB'^. 

Note that, the robust filter is a risk sensitive like 

filter, [17], [16, Chapter 10]. In the classic formulation, 
however, the risk sensitive Riccati mapping is defined as 

rf^(P) = A[P-^ + C^{DD'^)-'^C - 01]-^ + BB^ 

where the risk sensitivity parameter 0 > 0 is constant and 
does not depend on P. Moreover, for 9 = 0 (risk neutral 
case) we obtain the Riccati mapping 


The pairs {A,B) and {A,C) are assumed to be reachable 
and observable, respectively. Moreover, we assume that the 
noises Ut and Vt affect all the components of the dynamics 
® and observations that is BB^ and DD^ are positive 
definite. As observed in [12], this is a natural property to 
demand when the relative entropy is used to measure the 
proximity of statistical models, see below. 

The robust filter proposed in [12] is designed according 
to the minimax point of view. More precisely, at time t, the 
nature selects the least favorable increment of the state space 
model in a ball about the nominal increment given by Q-®. 
Such a ball is characterized by requiring that the Kullback- 
Leibler divergence, [4], between the two model increments 
is smaller than or equal to the tolerance c > 0. Note that, c 
is fixed by the user. More precisely, the larger c is, the worse 
increments the nature can select. 

It turns out that the robust estimator Xt+i of Xt+i, given 
the observations yt,yt-i, ■ ■ ■ ,yo, obeys the Kalman like 
recursion 

xt+i = Axt + Ktvt , ( 3 ) 

where 

vt = yt- Cxt (4) 

is the innovations process. In ®, the gain matrix 

Kt = - 9t-iIn)-^C^{R!'t)-^ , (5) 

where 

i?r = E[vtiyf] = c(pr' - et-iinr^c^ + dd'^ (6) 

represents the variance of the innovations process, 9t-i with 

0 < 9t-i < (Ai(Pt))”^ (7) 

is the unique solution to the equation 

7(0t_i,P*)=c (8) 

where 

7(0, P)^\ [log det(/ - 9P) + tr[(/ - OP)-^] - n] , 

(9) 

and if xt = Xt — Xt denotes the state prediction error, its 
variance matrix Pt = E[xtxf] obeys the distorted Riccati 
iteration 

Pt+i = r^{Pt) (10) 


r(P) = A[P-^ A C^{DD'^)-^C]-^A^ + BB^. 

Finally, it is worth noting that (|^ implies that r^{P) G V 
for each P G V, that is r^(-) is a mapping of V. Such a 
property does not hold for the classic risk sensitive mapping 
because it may occur that r^^{P) ^ V even when P G V, 

[13]. 


III. Thompson part metric and contraction 

MAPPINGS 

If P is an element of V with eigendecomposition 

P = UKU'^ (11) 


where U is an orthogonal matrix formed by normalized 
eigenvectors of P and A = diag {Ai,..., A„} is the diagonal 
eigenvalue matrix of P, the square-root of P is defined as 

pl/2 ^ 

where is diagonal, with entries A^^ for 1 < i < n. 
Similarly, the logarithm of P is the matrix specified by 

log(P) = C/log(A)[/^ , 

where log(A) is diagonal with entries log(Ai) for 1 < i < n. 
Let P and Q be two positive definite matrices of V. Then 
P~^Q is similar to so they have the same 

eigenvalues, and is positive definite. The 

Thompson part metric between P and Q is defined as 

dT{P,Q) = ||log(p-i/2gp-i/2)||2 

= max{log(Ai(P"^Q)),log(Ai((3"^P))} , 


where || • ||2 denotes the spectral norm. 

Let /(•) be a non expansive mapping of V. Its contraction 
coefficient (or Lipschitz constant) is defined as 


iif) = sup 

P,Qev,P^Q 


dT { fiP)jm 

dT{P,Q) 


( 12 ) 


From ([T^ we get 


dT{f{p)jm<af)dT{p,Q). 


Moreover, if ^(/) < 1, then / is a strictly contractive 
mapping. 

If / is a strict contraction of V for the metric dr, by the 
Banach fixed point theorem [1, P- 244], there exists a unique 
fixed point P of / in P satisfying P = /(P). Moreover, 









if the TV-fold composition of a non-expansive mapping 
/ (or simply -block mapping of /) is strictly contractive, 
then / has a unique fixed point. Furthermore this fixed point 
can be evaluated by performing the iteration Pk+i = f{Pk) 
starting from any initial point Pq of V. We will consider in 
particular the Riccati like mapping defined by 

f{P) = M[p-^+n]-^M'^+ W (13) 

where P, and W are symmetric real positive definite ma¬ 
trices and M is a square real, but not necessarily invertible, 
matrix. For this mapping the following result was established 
in [11, Th. 5.3]. 


Lemma 3.1: The mapping in ( [T3| l is strictly contractive 
with 


m < 


1 + 


(14) 


In the model ([T7])-([T9ll 

TZn = [ B AB ... A^-^B ] 

On = [ {CA^-Y ■■■ (CA)^ ^ 

0 § 4 [(^^- 1 )^ ... {Ar if 

Vn = In® D. (20) 


Note that TZn and On denote respectively the A^-block 
reachability and observability matrices of system Q-Q, 
where the blocks forming On are written from bottom to 
top instead of the usual top to bottom convention. If the 
pairs {A, B) and (C, A) are reachable and observable, TZn 
and On have full row rank for > n. In (18 1 and (191, if 



CA^-^B t > 1 
0 otherwise 

A*-^B t > 1 

0 otherwise 


Lemma 3.1 is the key result we will use to prove that iteration 
( [T0| l converges for any Pq € 'P, and thus the risk sensitive 
like filter converges. 

IV. A^-block risk sensitive like Riccati mapping 

The robust filter 0-(i can be interpreted as solving 
a standard least-square filtering problem with time-varying 
parameters in Krein space. The Krein state space model 
consists of dynamics Q and observations Q, to which we 
must adjoin the new observations 


R 


0 = Xt + v. 

The components of noise vectors ut, Vt and uf 
to a Krein space and have the inner product 


(15) 

now belong 


/ 

Ut 


Us 

\ 

Ijn 

0 

0 

( 

Vt 

5 

Vs 

> = 

0 

Ip 

0 

\ 

. . 


. . 

/ 

0 

0 

-{Ot-lYln 


(16) 

Note that, in the classical risk sensitive framework, [6], [7], 
with f > 0 are identically distributed, whereas are not 
in this setting. Since Xt is Gauss-Markov, the downsampled 
process x'l = XkN, with N integer, is also Gauss-Markov 
with state space model 


''k + l 

Yk 


A^x'l -f IZnY 


IZn^u 

d I TS I nj 


— ONXf^+PN'^k PP-N'^k 


0 = 


/r\R d I ^y.RN I n N 

Y'N^k P^k P pN^k 


(17) 

(18) 
(19) 


where 




[ '^kN+N-l 

T 

'^kN+N-2 

■ ■ ■ '^'kN 
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T 
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■ ■ ■ '^kN 

yf 


[ VkN+N-l 

T 
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■ ■ ■ VkN 
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= [(■ 


kN+N 
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[V 


R \T 
kN> 


Hn and £ at are block Hankel matrices defined as follows 
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The Krein space inner product of the observation 

vector 









[ wf 1 


r vnY 1 
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Pn 

11 ^ 




[ vr J 

H" 

Cn 



admits the following decomposition 



[ ^k 1 

PnPJ^ 

0 

0 

— {P>N,k] 


Inp 


^k 

,RN 


-1 


Pn 

Cn 


[P 


CT' 

l-N 


0 

Inu 


PnP'n{Pn'P'n PPnP'n) ^ 

PnP'n P PnPn 0 

0 

Inp {VnPn PPnPn)~^PnPJ[ 


0 


iNn 


where 


A 


( 21 ) 


SeNk=~i^N,k) '^pPNilNmPPNi'PN'PN) ^Pn) ^ Pjf 

( 22 ) 




































denotes the Schur complement of the (1,1) block inside 
/Ce«. with 


A 


©Af.fc = diag{9kN+N-2, 9kN+N-3, ■ • ■ , dkN-l) <8) In- (23) 

The projection of noise vector on the Krein 

subspace spanned by the observation noise vector 
[ i'^k)'^ ] is then given by 


= [ G, ] 


^k 

,NR 


where 


[G, G§] = [nl C%]{ICe.J-\ 

and the residual has for inner product 

Qe,,, = K,n^) 

= lNm-[ Gk Gf ] /Ce, 


Gl 


(G^r 


= [iNm + Hn(,'II’nII>n) ^Hn — Gj^QN,kGN] ^ ■ 

Then by multiplying the observation equation obtained by 
combining equations (18 i and (19 1 by TZtq [ Gk G^ ] and 
subtracting it from (VI), we obtain the state space equation 


Xfc+i = aN.kxl + HN^k + ^NGkYk 


rN 


with 


aN,k =A^ - TZN[GkON + GjlO^] 


(24) 


where the driving noise is now orthogonal to the noises 


wj, and appearing in observation equations (18i and 

Accordingly, the time-varying Riccati iteration associ¬ 
ated to the downsampled model takes the form 


pa — r-a( pa\ 
-^fe+1 'c\-^k) 


(25) 


where 


riiPk) = c^nA{P^)-^ + 


= [ Ol {O^r ] 


-1 

Nyk 


On 

(jR 




(26) 


with 


A 


Jn = O^-CNHkpNPh + PNHkrON 


A 


and 


Hn = okiVNVh + PNnkr^ON 




( 27 ) 


V. Convergence of the risk sensitive like filter 

In this Section we show that the filter asymptoti¬ 

cally converges, or equivalently iteration ( fl^ converges for 
any Pq G V, for tolerance values sufficiently small. 

The idea is to find an upper bound for c, say cmax, for 
which the Gramians ^ and IkeN & positive definite 
for k > q and q is a fixed integer number. Then, by Lemma 


3.1 r(:(-) is a strict contraction for k > q. Since r^(-) is the 


A^-block mapping of r^(-), we conclude that iteration (101 
converges. 

Note that, ^ and TLo^ ^ depend on the positive 
definite matrix QN,k- It is not difficult to see that, a sufficient 
condition to guarantee positive definite for 0 A <1) ^ 
(’nInu, and thus also for N > n, is that 

1 


(‘N = 


\liCN{lNm + n%{VNVj,)-^nN)-^cl) 


> 0 . 


Such a condition also guarantees that is negative defi¬ 
nite. 

Lemma 5.1: Let $i and $2 be such that 0 ^ $2 ^ 4>i ^ 
^nInu- Then, 


n 


<i>i ^ 


A 


n $2 


(28) 


Note that 


N 




I^^U=0 = P-NilNm +PNiPNPN) VLn) '^-n 

which are positive definite matrices for N > n because 
the pairs (G, A) and {A, B) are observable and reachable. 


respectively. Accordingly, in view of Lemma 5.1 there exists 
a constant (pN > 0 with N > n such that 


W<j> A 0, VO ^ ^ PnInt, 


(29) 


As noticed before, 1L$ is positive definite over the range 
0 ^ 4) -< (pnlNn- Then, we set (pN = 4 >n ™d check whether 
^4‘NiN-n positive definite or not. If not, we decrease pN 
up to becomes positive semi-definite but singular. 

In this way, (291 holds. Therefore, if QN,k A PnInu for 
k > q, then the Gramians and Wq^ ^ are positive 

definite for k > q, and thus iteration converges. 

Now, we characterize an upper bound for c, say cmax, 
which guarantees that QN,k A PnInu for k > q. To this 
aim, we need the following two Propositions. 


Proposition 5.1: Consider iterations ( [T0) l and (25 1 with an 
arbitrary Pq gV. Moreover, consider the iteration 

Pt+i=r(Pt), Po=BB^. 

Then, after a finite number of steps, say q -f 1, we have 

Pth Pq, t>q + l, 

and after q = [ steps 

Pk hPq, k> q. 




















Proposition 5.2: Assuming that 0 < 6 » < {XiiP))-\ the 
following facts hold: 

1) is monotone increasing over IR+ 

2) 7 ( 0 , P) > 0 for any P gV with P 7 ^ 0 

3) If PPQ then 7 ( 0 , P) > 7 ( 0 , Q) 

By Proposition |5.l[ Pt P Pq with t > q + 1 and q is fixed. 
Then, by Proposition |5.2[ (|^ implies that 

9t-i <0, \/t> q+1 

where 9 is such that '){9t-i,Pt) = 7 ( 0 , Pq). Thus, condition 
0Af,fc < for k > q is guaranteed if we choose c in 

such a way that 9 < (/jat. The idea is formalized in the 
following Proposition. 

Proposition 5.3: Let c be such that 0 < c < cm ax with 
Cm AX = Pq), N ^ ti, and q fixed. Then, the mapping 

r^(-) is strictly contractive after steps. Accordingly, 

iteration ( [TOl i converges to a unique solution for any initial 
condition Pg G P. 



Fig. 1. Minimum eigenvalue of over t/ijv S [0i 8'10 ^]- 


Note that, by Proposition |5.2| the map 

q i-A j{(j)N,Pq) 

is nondecreasing. Thus, we have to choose q sufficiently large 
in order to find a bigger cmax- 


VI. An Example 


We consider the Gauss-Markov state space model earlier 
employed in [13] 


A = 


0.1 1 
0 1.2 
C=[l -1] 


B = l2 
D = 1 


(30) 


with n = 2, m = 2 and p = 1. We select = 8, in this way 
N > n. Note that, larger values of N can be considered. We 
find that = 1-6• 10“^. In Figure[^we depict the smallest 
eigenvalue of over the range G [0, 8 • 10“^]. We 

find it becomes zero for (j)N = 1.3 x 10“^. 

In Figure]^ we^epict 7 ( 0 , Ao), 7 (^)Ao) ™d 7 ( 0 ,P 35 ). 
Note that, 'y{(j)N,Pio) = 2.9 • 10“^, ^{4>n,P2o) = 4.39 • 
10 “^ and "/i4>N, P 35 ) — 5.43 • 10 “^. As expected, it is 
better to choose P 35 for which we have cmax — 5.43 -10 
We conclude that the risk sensitive like filter ([^-([^ having 
tolerance parameter c < 5.43 • 10“^ and nominal model (301 
asymptotically converges to a unique solution. 


VII. Conclusion 

We analyzed the convergence of a risk sensitive like filter 
subject to an incremental tolerance. By contraction analysis, 
we showed that the corresponding 7V-block risk sensitive 
like Riccati mapping is strictly contractive for tolerance 
values sufficiently small. Accordingly, the corresponding 
iteration converges to a fixed point, and thus the robust filter 
converges. 



9 X 10 '' 

Fig. 2. 7 ( 0 , Pq) with q = 10, q = 20 and q = 35. 


Appendix 


A. Proof of Lemma 5.1 


Let $ and A belong to V and P, respectively, and such 
that 0 A $ + A ^ fniNn- The first variation of 5^^ and 
Qq, with respect to $ in direction A are, respectively. 


and 

<^Q3>;A = Q$P^A£jvQ3> a 0 . 


This implies that 


c-l 

*3$ 




S' 


-1 

$2 


Q<i>i A Q‘1>2 













































with 0 ^ $2 ^ ‘&1 ^ 4>NlNn- Accordingly, 

A 7?,7vQ$2'^w = ^$2 ■ 


B. Proof of Proposition 5.1 


Before proving the statement, we need the following two 
Lemmas. The hrst one regards a risk sensitive mapping 
property, [ 8 , page 379]. 

Lemma 1.1: Let P G V such that P~^ — Oil G V and 
6 »i > 6*2 > 0. Then, 

r^fiP) h rg^{P). 

Lemma 1.2: Consider the sequence 

Pt+i=rf(Pt), PohBB^. 


where we exploited the fact that {I — OP) 2 and —I + (/ — 
9P)~^ commute. ■ 


D. Proof of Proposition |5.3| 

Consider iterations ( [TOl l and (25 1 . As showed in Propo¬ 
sition |5.H after a finite number of steps, that is g + 1 and 
q = \ , we have, respectively. 


Pt A Pn 


-z r-q, t > g + 1 


Pk h Pq, 


k> q. 


Since c < j{4>nj Pq), by Proposition 5.2 we have 9t < 4 >n 


for t g P 1 and therefore 0 iv,/c P fN^Nn for k P q. 
Accordingly, the Gramians, a nd lTe„ j, are positive 

definite for k > q. By Lemma 


3.1 


the mapping rff) is 
strictly contractive after g steps. Since rff ) is the A^-block 
mapping of r^{-), it follows that the sequence generated by 
converges for any Pq G P. ■ 
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Then, 


PthPq, y t> q. 


Proof: We prove by induction that 
PtPPi, Vf>0. 
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